Corpus Linguistics: Readings in a Widening Discipline

نویسنده

  • Rob Malouf
چکیده

Not that long ago, all linguistics was corpus linguistics. For much of the twentieth century, though, changes in theoretical fashion put linguists for whom collections of texts are the primary object of study at the margins of the field. Now things are shifting again, and a new generation of linguists with a wakening interest in linguistic data is returning to the field's corpus-based roots. What they are finding is that technological and methodological advances have quietly revolutionized corpus linguistics. In Corpus Linguistics: Readings in a Widening Discipline, editors Geoffrey Sampson and Diana McCarthy have put together a volume aimed at filling in some of what has happened in corpus linguistics while no one was watching. This collection of 43 reprinted papers, with original publication dates ranging from 1965 to 2002, includes a few of the usual suspects and a number of more-often-cited-than-read classics, making it an ideal source for an upper division course on corpus linguistics, or to supply what new corpus linguists (or their teachers) wish they had learned in grad school. Beyond these core contributions, however, the editors have wisely included an idiosyncratic selection of more-obscure papers guaranteed to stir the imagination of anyone with a professional interest in language. Papers on, for example, second language teaching, the discourse properties of Internet Relay Chat, and " non-indigenous minority languages " (such as South Asian languages spoken in the UK) show some of the possibilities that corpus-based methods have to offer beyond their philological foundations. While presented in chronological order, the papers in this volume for the most part fall into three classes. The largest is made up of papers on corpus design and methodology , such as Nelson Francis on the construction of the Brown corpus, Burnage and Dunlop on the British National Corpus, and B ¨ ohmová and Hajičová on the Prague Dependency Treebank. The second group are papers on the descriptive analysis of corpora. One, an excerpt from Charles Fries's The Structure of English, predates computational corpus analysis, but most of the rest are quantitative studies of English grammatical features. This group includes papers on cleft and pseudo-cleft constructions (Collins), that vs. null complementizers (Rissanen), and the use of terms of abuse (McEnery et al.). The third class of papers addresses technical issues in computational linguistics. Some of the classics in this group include Gale and Church on smoothing corpus counts, Hindle and Rooth on prepositional …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Do We Need Discipline-Specific Academic Word Lists? Linguistics Academic Word List (LAWL)

This corpus-based study aimed at exploring the most frequently-used academic words in linguistics and compare the wordlist with the distribution of high frequency words in Coxhead’s Academic Word List (AWL) and West’s General Service List (GSL) to examine their coverage within the linguistics corpus. To this end, a corpus of 700 linguistics research articles (LRAC), consisting of approximately ...

متن کامل

Gearing the Discursive Practice to the Evolution of Discipline: Diachronic Corpus Analysis of Stance Markers in Research Articles’ Methodology Section

Despite widespread interest and research among applied linguists to explore metadiscourse use, very little is known of how metadiscourse resources have evolved over time in response to the historically developing practices of academic communities. Motivated by such an ambition, the current research drew on a corpus of 874315 words taken from three leading journals of applied linguistics in orde...

متن کامل

ACADEMIC WRITING REVISITED: A PHRASEOLOGICAL ANALYSIS OF APPLIED LINGUISTICS HIGH-STAKE GENRES FROM THE PERSPECTIVE OF LEXICAL BUNDLES

Lexical bundles are frequent word combinations that commonly appear in different registers. They have been the subject of much research in the area of corpus linguistics during the last decade. While most previous studies of bundles have mainly focused on variations in the use of these word combinations across different registers and a number of disciplines, not much research has been done to e...

متن کامل

Metadiscourse in Applied Linguistics and Chemistry Research Article Introductions

This study examined disciplinary rhetoric in research articles, focusing on different traditions in structuring text discourses from a metadiscourse-move analytic approach. The corpus consisted of 72 research article Introductions (RAIs): 36 in applied linguistics and 36 in chemistry. Swales’ CARS model (1990, 2004) and Hyland’s interpersonal model of metadiscourse (2005) were used as analytica...

متن کامل

“Based on the data in …” Cohesive markers in Results and Discussion Section of Research Articles

Cohesive frames are linguistic elements that precede the grammatical subject in the main clause. This study investigated the frequencies and communicative purposes of cohesive frame types in results and discussion section of research articles from 4 disciplines. To run this study, 40 results and discussion sections of research articles were selected from 4 disciplines, namely Applied Linguistic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Linguistics

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2006